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ABSTRACT. We study the diagonal heat-kernel decay for the four-dimensional nearest-neighbor 
random walk (on Z 4 ) among i.i.d. random conductances that are positive, bounded from above but 
can have arbitrarily heavy tails at zero. It has been known that the quenched return probability 
P 2 i J ! (0,0) after 2n steps is at most C(o))n~ 2 logn, but the best lower bound till now has been 
C(a>)n~ 2 . Here we will show that the log« term marks a real phenomenon by constructing an 
environment, for each sequence X n —¥ °°, such that 

P^(0,0)>C(£0)log(«)n- 2 /A„, 

with C(co) > a.s., along a deterministic subsequence of n's. Notably, this holds simultaneously 
with a (non-degenerate) quenched invariance principle. As for the d > 5 cases studied earlier, the 
source of the anomalous decay is a trapping phenomenon although the contribution is in this case 
collected from a whole range of spatial scales. 



1. Introduction and Results 

Recent years have witnessed remarkable progress in the understanding of a class of reversible 
random walks in random environments that go under the name Random Conductance Model. The 
setting of a typical instance of this problem is as follows: Consider the d-dimensional hypercubic 
lattice Z d and let M d denote the set of unordered nearest-neighbor pairs. For a configuration 
ft) = (QOb)ben d e (0,°°) Brf , define the Markov chain X = (X n ) n >o with state space Z d and transition 
probability 

10, otherwise, 

where 

K m (x) := £ a^. (1.2) 

y: (x,y)eB d 

Sometimes even (Ob = is permitted; the state space is then just {x: n m (x) > 0} or, when such 
exists, an infinite connected component thereof. Let denote the distribution of X subject to 
P'a{Xo = x) = 1. The principal items of interest are various asymptotics of the law of X under P^ 
in the situation when ft) is a sample from a probability distribution P. 
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Much of the early effort by probabilists concerned the validity of the (functional) Central 
Limit Theorem. In a sequence of papers (Kipnis and Varadhan [27], De Masi, Ferarri, Gold- 
stein and Wick |[T9l |20l , Sidoravicius and Sznitman [34], Berger and Biskup [9], Mathieu and 
Piatnitski OTl . Mathieu [301, Biskup and Prescott [13], Barlow and Deuschel Q, Andres, Bar- 
low, Deuschel and Hambly 0]]), it has gradually been established that, as n — > oo, the law of 
t h-» Xufi / ^fn under P° scales to a non-degenerate Brownian motion for almost every environ- 
ment CO, provided that certain conditions are met by the law of oo. For i.i.d. laws F concentrated 
on [0, oo) B '' , in d > 2 these conditions are 

E(flfc) < oo and F{oo b > 0) > Pc (d), (1.3) 

where E denotes the expectation in F and p c {d) is the critical threshold for bond percolation 
on l/. In d = 1 the second condition needs to be replaced by E(g)^ 1 ) < oo; independence is not 
required (e.g., Biskup and Prescott |[T3l ). The same conditions as in d = 1 are sufficient to imply 
the quenched CLT in d = 2 for general ergodic environments (Biskup ifTTI ). 

While the proof of the functional CLT is remarkably soft for the law on path space that is 
averaged over the environment — the so called annealed or averaged law — the almost-sure or 
quenched law generally requires also the heat kernel upper bound, 

P l^y)^i^l2^ X ~ y?,n ' n>N(00,x,y). (1.4) 

This is conceptually wrong as the CLT seems to require a local-CLT type of estimate. Notwith- 
standing, for environments possessing some form of uniform ellipticity, these heat-kernel upper 
bounds can be obtained. Indeed, they are the results of the sequence of papers by Delmotte lfT8l . 
Benjamini and Mossel [8], Heicklen and Hoffman [26], Mathieu and Remy [32] culminating in 
Barlow's work [3 ] for the simple random walk on the supercritcal percolation cluster. (We regard 
this case as uniformly elliptic too although on a spatially inhomogeneous graph.) However, in 
the environments with heavy tails at zero, it was in fact discovered that (11.41 ) may fail (Fontes 
and Mathieu l23l . Berger, Biskup, Hoffman and Kozma iPTOl ) and a coarse-graining procedure 
was required to overcome this difficulty and derive the quenched functional CLT (Mathieu l30l . 
Biskup and Prescott |[T3l ). We note that when the left condition in (11.31) fails, the scaling limit 
of X may be not be diffusive at all (Barlow and Cerny [4], Barlow and Zheng [7 ], Cerny IfTTI ). 

The study ifTUIl presents two types of results. First, for i.i.d. environment laws bounded from 
above, it restricts the diagonal heat-kernel decay by the following estimates 

V d / 2 , d = 2,3, 

n~ 2 logn, d = 4, (1.5) 

n~ 2 , d>5, 

where C{oo) < °° almost surely, with the additional observation, 

« 2 P^ l (0,0) >0 P-a.s. ind>5. (1.6) 

Second, for any sequence X n f °°, an i.i.d. environment law is constructed so that 

P 2 ^(0,0)>^4, k>l, (1.7) 



P2,(o,o)<c(o) { 
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along a deterministic sequence — >• °°, where C((o) > almost surely. Since the Central Limit 
Theorem for X n holds, we also have 

P-(0,0)>^, (1.8) 

cf, e.g., 031 Remark 2.2]. 

Putting the bits and pieces together we conclude that the return probability P^ ! (0,0) always 
decays diffusively in spatial dimensions d = 2,3, while, in dimensions d > 5, it can decay as 
slow as o(n~ 2 ). (In d = I, the decay can be arbitrarily slow.) Further progress has been made by 
Boukhadra lfT4l[T5l who showed that the transition from regular decay n~ d l 2 to anomalous decay 
n -2+o(i) i n j > 5 actually occurs in the class of power-law tails — with the exponent y = d/2 in 
P(0 < (Ob < s) ~ s r being presumably the critical for the anomaly to appear. In d > 5 this meshes 
nicely with the annealed estimates obtained by Fontes and Mathieu |[23l . 

The combined results of lfl0l[T4l[T5l provide definitive answers in all spatial dimensions except 
d = 4, where (11.51 ) and (11.71) differ by a logarithmic factor. Computations for time-dependent 
environments (cf. Theorem 5.3 of iflOl ) suggested that (11.5b is presumably the one closer to the 
truth, but any feasible method of proof seemed to require control of off-diagonal heat-kernel 
lower bounds. This would seem in turn to demand — in order to avoid circuitous reasoning — 
running a complicated induction along scales. We are thus pleased to report on a conceptually 
straightforward, albeit still technically complicated, proof of the following theorem: 

Theorem 1.1 Assume d = 4. For every sequence {A,,} with A„ f °°» there exists an i.i.d. environ- 
ment law P with P(0 < (Ob < 1) = 1, a random variable C{(o) with P(C > 0) = 1 and a sequence 
m — > oo such that for every n £ {n{\, 

p 2 ;(o,o)>cw^. (i.9) 

The conclusion is that the possibility of anomalous heat-kernel decay in the random conduc- 
tance model extends to all dimensions d > 4; yet in d = 4 the correction to diffusive behavior 
is only logarithmic. We actually believe, although cannot prove, that similarly to d > 5 the up- 
per bound in (11.5b can be approached arbitrarily closely but will not be attained in any given 
environment. We formulate this as a conjecture: 

Conjecture 1.2 Assume d = 4. For every i.i.d. environment P with P(0 < (Ob < 1) = 1, 

2 

P„ (0,0) >0, P-a.s. (1.10) 



n ~>2n 



log 72 



We point out that the proof of the asymptotic (11.6b in d > 5 does not seem to carry over to this 
case and so presumably a new idea is needed here. Update in revised version: The conjecture 
has in the meantime been proved in an upcoming preprint by Biskup, Louidor, Rozinov and 
Vandenberg-Rodes 021 • A novel input is the use of the Dominated Ergodic Theorem. 

Here is the main idea underlying the proof of Theorem 11.11 In order to ensure that the chain 
returns to its starting point at a required time, we can either let it arrive there more or less by 
accident (the dominating strategy in d = 2, 3) or make it fall into (and hide inside) a specific trap 
nearby which makes a later return to the starting point considerably less difficult (the dominating 



4 



M. BISKUP AND O. BOUKHADRA 



strategy in d > 5). However, in d = 4 the difference between the two strategies is so subtle that 
we can no longer force which trap the chain falls into; it needs to find one by itself. This puts us 
in the class of ideas underlying the approach taken by Boukhadra fT4l [131 to control the d > 5 
anomaly in the class of power-law tails. 

Remarks 1.3 (1) Should one desire to have (11.91) without a random constant, this can be done 
by making the range of &'s random. 

(2) We did not try to optimize the tails of the distribution of a>b for which the anomalous 
behavior occurs in d = 4 although we think it is unlikely to occur in the class of power-law tails. 
On the other hand, our method of proof — being closer to the approach of lPT4l [T5l — could 
conceivably be adapted to yield a proof of a sharp threshold in the exponents that would yield 
anomalous behavior in d > 5. 

(3) As in ifltJll . we made no attempt to derive off-diagonal estimates on the heat kernel. Thus, 
in all cases d > 4, a question remains how one can reconcile the subdiffusive diagonal heat-kernel 
decay with the standard heat-kernel decay that should resume validity (at least on average) at the 
diffusive space-time scale (by the CLT). 

(4) The proof of the upper bounds (11.51 ) in [10] applies even to non-i.i.d. environments that 
possess a "strong" component almost surely. (This can be guaranteed by requiring that the law is 
dominated from below by an i.i.d. law in which edges with positive conductances percolate.) On 
the other hand, it is not hard to construct correlated environments that would make the heat kernel 
decay arbitrarily slowly (in any d > 1). A question remains whether there are some robust (e.g., 
moment) conditions that would imply a "standard" diffusive decay regardless of correlations. 

(5) An interesting question is what strategy dominates the event {X2„ = = Xq} when anoma- 
lous decay occurs. In particular, does the walk visit at most one trap during its run or a sequence 
of traps (with various "strengths" of trapping), etc? Update in revised version: This question has 
also been resolved in the aforementioned preprint [12]; the path does spend a majority of its time 
in a few very localized places. 

(6) It would be interesting to see whether and how the swMiffusivity of the random walk 
among random conductances manifests itself in the behavior of its loop-erasure. Here we note 
that, for instance, in d = 2 the scaling limit of the loop-erasure of the simple random walk on 
the supercritical percolation cluster coincides with that for the ordinary simple random walk 
— namely SLE2 (Yadin and Yehudayoff 11351 ). The question is thus whether one can extend 
this remarkable result to other dimensions (obviously, with a different scaling limit) and other 
conductance laws. 

The plan of the remainder of this paper is as follows. In the next section we collect the ideas 
entering the proof and structure the main steps into proper lemmas and propositions. The proof 
of the main theorem then can be given subject to a Key Lemma (Lemma [2.61 ) that controls the 
number of traps the chain typically sees along its path. The Key Lemma is then subsequently 
reduced to moment bounds on the corresponding number for a coarse-grained chain; this is done 
in Section [3] In Section 0] we then invoke certain technical facts about the heat kernel for the 
coarse-grained chain, and also the trap density, to justify these moment bounds. These technical 
facts are then proved in Section [5] (trap density bounds) and Sections [6][7] (heat-kernel estimates). 
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2. Key steps of the proof 

A distinguished feature of the four-dimensional problem, and the reason why the heat-kernel 
anomaly is manifested only by logarithmic corrections, is that the leading contribution to return 
probability may come from a whole range of spatial scales. Anticipating some form of scale 
invariance, we partition Z rf into a sequence of (disjoint) annuli 

B k :={xeZ d :2 k - 1 -\<\x\ 0O <2 k }, k>0. (2.1) 

Let denote the cardinality of B^. An opening step of the proof is the following version of a 
standard (deterministic) Cauchy-Schwarz estimate. 

Lemma 2.1 Suppose that < (Ob < I for all b. Then 

W[,) - 2d jL \ Bk \ ■ M 
Proof. By the Markov property and reversibility 

P 2 W "(0,0)=£ P" ffl (0,x)P';(x,0) 



.v6L 



2 Mo) 

K a (x) 



(2.3) 



Bounding 7t m (x) < 2d and using that {Bk}k>i form a partition of Z , we get 

Ca>(( 
2J 



P 2 ffl "(0,0)>^£ £P"jO,x) 2 . (2.4) 

k>0xeB k 



By Cauchy-Schwarz, the sum over x exceeds l P®(X„ G Byt) 2 - D 

In order to motivate our next step, we recall a classic argument (cf, e.g., [13, Remark 2.2]) 
that shows how the CLT implies (11.81) . Indeed, for k := L^logiMJ anc ^ 1 we nave that 
diam(B,t) ~ \fn and so the quenched CLT gives 

Pi (X n G B t ) > d (to) > 0, P-a.s. (2.5) 

Retaining only the corresponding term in the sum, from \Bjc\ < C'n d l 2 we get 

P>,0)>^ = ^, (2.6) 
n a l £ d=4 « z 

where C2 > a.s. This is the standard diffusive decay. The key idea underlying our work is that, 
in d = 4 there are environments for which order-log n other k's contribute a comparable amount 
to (12.21) — thus producing a log n multiplicative term. 

In order to state the requisite lower bound on F%(X n G Bk) in dimension-independent form, 
consider the abbreviation 

tk ■= 2 2k , k>0, (2.7) 
and note that this is the diffusive time scale associated with the spatial scale of B^. 
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Proposition 2.2 Let d > 4 and consider an i.i.d. law P satisfying P(0 < (Ot, < 1) = 1 for which 
there exists a sequence ni — > °° such that the quantity 

Pn := PH > V 2 ) P(V„ < (Ob < 2 /n) 4d ~ 2 (2.8) 

obeys 

p ni \ogni >oo. (2.9) 

There are random variables C\ = C\ (ft)) andN\ = N\ (ft)), w/?/j Ci (ft)) > andN\ (ft)) < °° P-O.J., 
swc/j that for all n G {«<?}<?> i with n>N\ (ft)) a«<i all k> 1 satisfying 

gOoglog,!) 2 < ^ < (210) 
log« 

P° m (X n £B k ) >d(ft))p„-. (2.11) 

Remark 2.3 Note that the fact that p„ is summable on n — which is seen, e.g., from the bound 
Pn < P(Vn < ft>fo < 2 ln) — forces us to work with subsequences in (I2.9I ). On the other hand, 
the requirement of subpolynomial decay of p n is convenient, albeit perhaps unnecessary, for our 
proofs. (Specifically, this assumption is used in Lemmas 14.21 and 14.51 and also in the proof of 
(13.171) from Lemmas 14.4144.51 ) All of these steps will need to be reevaluated when studying the 
question for what tails does the anomaly start to occur in d = 4. (In d > 5, this question has been 
addressed by Boukhadra lfl4l[T5l .) 



Proposition I2.2l permits us to finish the proof of our main result. It is important to note that in 

k 



(and only in) d = 4 we have that K ~ |2?jt| which puts all terms in the sum in (12.21) on the same 



order of magnitude. 

Proof of Theorem \l.l\ from Proposition \2.2\ Abbreviate 6 := 1/[2(4J — 2)] and suppose d = 4. 
We may assume without loss of generality that A„ tends to infinity so slowly that 

X-' /2 logn >oo. (2.12) 



n— >°° 



Let {n?}e>i be an increasing sequence of integers with nt > 1 and such that 

1 
2 



ZK 6 <\- (2-13) 



We then define the environment law P to be an i.i.d. measure whose one-dimensional marginals 
are concentrated on {1} U {nj 1 }e>i with probabilities 

P((O b = nj l ) := X- e , l>l, (2.14) 

and 

P(ft) & = l):=l-£A f - e . (2.15) 

l>\ 

Note that, for this environment, 

pl>F(G) b = l) 2 P((D b = njY 4d ' 2) >\K ( \ *>h (2-16) 
and so ( 12.91) is implied for the subsequence {nf\ by (12.121 ) and (12.161) . 



HEAT-KERNEL DECAY IN RANDOM CONDUCTANCE MODELS 7 

Now pick n G {ni}i>\ with n > N\{co), where N\ as in Proposition 12.21 and introduce the 
shorthand 3f(n) := {& G N: e ( lo z lo z") 2 <t k < n/log(n)}. LemmaOand Proposition El imply 

Pj(0 ,o) > *f> £ iSS^ 

> ^) Cl ((D) 2 f^ 2 2- 4 |^(n)L (2.17) 



where we used that t\j\Bk\ > 2 4 . For n > 1, we have |^*(n)| > 2iog 4 7i > jlogrc. Hence, for 
n G {ni}i>i sufficiently large, 

P ^o,o),M^)!0. 

This and the fact that P% (0, 0) > for all n > 1 imply the claim. □ 

It remains to construct the proof of Proposition |2.2j As in the examples showing anomalous 
decay in d > 5, a mechanism that could make P%{X n G Bk) large even when n 3> (which is 
outside the central-limiting scaling) is to let the walk fall into a trap. In analogy with lfl0l[T4l[T5ll , 
we adopt the following (somewhat arbitrary) definition: 

Definition 2.4 A trap at scale n is an edge b = (y, z) such that (Ob > V2 and such that for any 
edge b' ^b incident with either y or z, 

-<<%<-■ (2.19) 
n n 

Let s^ n (jc) be the event on the space of environments that x is a vertex neighboring a trap edge 
at scale n. Let us abbreviate 

B° h := {x G 1/ : 2 k ^ + 2 < |x|„ < 2* - 3} (2.20) 

and note that B° k C Bj: and, in fact, dist(B£,B£) > 3. In particular, if srf n (x) occurs for x G B° k , then 
the corresponding trap(s) and the edges incident therewith all lie in B^. The effect of trapping is 
captured by the next estimate: 



Lemma 2.5 Let d > 1. There is an absolute constant c\ = c\ (d) > such that for all n,lc> 1, 

P° a (X n efii)>V £ l { x f eB i} U(x c )) • (2-21) 
" V e =o / 

Proof. For x£*L d such that srf n (x) occurs, let (y, z) be the trap edge that makes stf n (x) occur. (In 
the presence of more such edges next to x, we pick the one that is smallest in a fixed complete 
order on M d .) We assume that this edge is labeled so that x and y are neig hbors in Z d . For I > 0, 
we use D n (x, £) to denote the event 

n 

D n {x,£):={X e =x}n{X £+1 =y}n f| {X m e{y,z}}. (2.22) 

m=l+l 



8 



M. BISKUP AND O. BOUKHADRA 



First we note that 

{X n eB k }D |J |J D n {x,£). (2.23) 

&/„ (x) occurs 

Indeed, on D n (x, £) (with x and £ in the unions above) the walk at time n is at one of the endpoints 
of the trap, which are both in B k by the restriction x 6 B° k . 

Next we claim that the unions in (12.231 ) are disjoint, i.e., D n (x,£) PiD n (x' ,£') = for any pairs 
of indices (x, £) / (xf , £') contributing to (12.23b - This is because on D n (x,£), the walk spends more 
than half of its time crossing a single (trap) edge — namely, (y,z) in (12.22b - This walk must have 
entered the trap from vertex x at time £ and so if (x 1 ,£') is distinct from (x,£), it cannot belong to 
D n (x',£'). We conclude 

n/2-1 

lt(X n EB k ) > £ £ P° m (D n (x,£)). (2.24) 

xeB° £=0 

The Markov property and a simple calculation imply 

P° m (D n (x,£)) >/to=x)^(l + l^-^y \ (2.25) 

The last two terms are at most ci/n for c\ := (2d)~ 1 e~ 4 ( 2d ~ l \ Once (12.251 ) is used for all terms 
in (12.24b . the sums combine into the desired expectation. □ 

The proof of Proposition I2.2l is now reduced to the following Key Lemma: 



Lemma 2.6 (Key Lemma) Let d > 4. For any i.i.d. law F satisfying P(0 < (Ob < 1) = 1 a«ci 
(12.91 ) for a sequence {«^}f>i, f/iere are F-a.s. finite and positive random variables C2 '■= C2(co) 
and N[ = Ni ((d) such that for all n € {w^}c>i with n>N\ and all k obeying (12.101 ) we have 

/n/2-l \ 
E l[ E l{x f eB|}l<( Xf ) j >C 2 (ffl)AA. (2-26) 

Proof of Proposition 12. 2\ from Key Lemma. The conditions on and « are identical, and combining 
<E26j with (E2D we get (EH]) with Ci(ffl) :=c\C 2 ((0). □ 



3. Proof of Key Lemma 

Our proof of the Key Lemma will require introduction of some technical tools that we will first 
try to motivate by giving a heuristic argument why (12.261 ) should hold true. 

Recall the notation fj from (12.7b . By reducing the sum in (12.26b to t k < £ < 2t^ — which is 
allowed because tk <C n by the assumptions (12.10b — the expectation in (12.26b pertains to paths 
of the random walk on temporal scale and spatial scale This is a diffusive scaling so one 
might expect that the law of Xg will be already close to the stationary distribution, and thus more 
or less uniformly distributed, over B° k . The expectation of each term in the (reduced) sum should 



HEAT-KERNEL DECAY IN RANDOM CONDUCTANCE MODELS 



9 



therefore be bounded below by a constant times P(^(0)). As 

P«(0))>p„, (3.1) 

and as there are order t% terms in the (reduced) sum, this would yield (12.26b . 

A fundamental problem with this reasoning is that, due to the presence of very weak bonds, the 
law of X( in B° k for tk<£< 2tu will not be close to the stationary distribution at the required level. 
After all, the sole purpose of this note is to demonstrate the failure of a local-CLT scaling! As 
in lfT^[T0l[30l[T4l[T5ll . we will circumvent this problem by observing the walk only on a strong 
component; namely, the connected component of edges b with (Ob> ct for some small enough a 
to be chosen momentarily. This walk already has good mixing properties but, unfortunately, the 
reduction of the expectation in (12.261 ) to this walk involves a time change that will now need to be 
controlled as well. And as this happens on the background of an expectation of a (large) random 
variable, we will have to control moments of this random variable as well. 

We now begin to formulate the aforementioned technical aspects precisely. Following up on 
earlier work lfl3l[l0l[T4l[l5l . we will introduce a cutoff a and examine the connectivity properties 
of the graph Go with vertices Z d and edges {b: (Ob > cc}. The key facts we will need are as 
follows: 

Proposition 3.1 Assume d > 2. Then there is po = po{d) G (0, 1) such that whenever P(ft)/, > 
oc) > Po, then the following holds F-a.s.: 

(1) The graph G w contains a unique infinite connected component ^»,a = ^oo. a (ft)). 

(2) The complement 7L d \ ff^a has only finite connected components. 

If ^ x denotes the connected component of7h d \ c €< x>M containing x(with ^ x = (dforx G ffcc.a) and 
dist ffl (x,y) is the shortest-path distance measured on then also: 

(3) Almost surely on {0 G ^.a}, 

dist^O,*) 

hmsup — < oo. (3.2) 

|jt|— >o° \ X \ 

(4) 7/'diam ft) (J^ v ) denotes the maximum of dist ffl (y, z) over all pairs of Z d -neighbors y,z G 
^..a of^ x , then diam ft) (^o) has all moments. (Naturally, dian^O) = 0.) 

Finally, let & x denote the union of ^ y fory running through neighbors ofx in Let G' m denote 
the graph obtained from ff^a by adding an edge between any y,z G with c S y T\ c S z / and let 
d' m (x,y) denote the graph-theoretical distance measured on G' m . Then: 

(5) For some £, > 0, 

limsup-i-logPfo^G^a&d^^,^;) <£,\x\) < 0. (3.3) 
Here and henceforth, \x\ denotes the Euclidean norm ofx. 

Proof (Sketch). First note that all properties (2-5) are properly stochastically monotone in a. (An 
exception is the uniqueness of ^o.a which holds for all a by an argument of, e.g., Burton and 
Keane iToll .) Our proof is best explained by running a coarse-graining argument: Consider bond 
percolation on 7L d with parameter p and call a unit cube of 2 d vertices in 7L d occupied if all of its 
edges are occupied. Two cubes are called adjacent if they share a side. By the result of Liggett, 
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Stacey and Schonmann [28], the fact that the cubes more than distance one apart are independent 
permits us dominate the process of occupied cubes from below by site percolation on 7L d with 
a parameter r)(p), where r\(p) f 1 when p 1 1. In particular, there is po G (0, 1) such that for 
all p > po, the occupied cubes percolate and the removal of the (a.s. unique) infinite component 
of occupied cubes results only in finite components whose diameters have an exponential tail. 

Properties (1,2) now follow immediately by standard facts about percolation on Z d while (4) 
is the consequence of the fact that diam ft) (^' A: ) will be bounded by the number of unit cubes 
adjacent to the finite component of the cube-process (necessarily) containing & x . Property (3) is a 
consequence of Theorem 1.1 of while property (5) is a restatement of Lemma 3.1 of lfl3l . □ 

Now let us fix po as in Proposition 13.11 and pick Oo G (0, 1) by 

P(fflfe > «o) > Po- (3-4) 

We will keep Ofo fixed throughout the rest of the paper. Note that the properties (1-5) in Proposi- 
tion l3.1l apply to all cutoffs a G (0, Oo]. 

Consider now a path of the Markov chain X. For any ft) with G we define a sequence 
T :=0,r l5 r 2 ,... via 

Tj+i := M{£ >T + --- + Tj : Xi E tf„ i0 } -(% + — + Tj) (3.5) 

and 

X r .= X Tl+ ... +Te , £>0. (3.6) 

The sequence (Xi)g>\ records the successive visits of (X n ) to the strong component ^. a - Note 
that we have 7) < °° for all j > 0, P°-a.s. In fact, there is a (deterministic) moment bound on the 
time the walk can "hide" in a component of 7L d \ ^L>,«: 

Lemma 3.2 (Hidding time estimate) For x G 7L d , let & x = & x ((o) be as in Proposition li.il 
Set c 2 := 4da-\ Then for all ft) G (0, l] Brf , 

E* a (Ti)<C2\9 x \. (3.7) 



Proof. This is a restatement of Lemma 3.8 from ifTUl . □ 

A fundamental concept in the study of random walks in random environments is the "point of 
view of the particle." The idea is that instead of recording the position of the walk relative to a 
given environment, we follow the sequence of environments that the walker sees along its path. 
Explicitly, let z x denote the "shift by x" which is formally defined by 

(r x (o) yz := C0y +XiZ+x , x G (y, z ) G M d . (3.8) 

Given a trajectory X = (X n ) n >o of the coarse-grained random walk in environment ft) with G 
^»,a(®)> the sequence (x% ft))«>o is itself a Markov chain on the space of environments with 
stationary measure 

Q«H :=Q(-|0 (3.9) 

where 

Q(do) := ^^P(dft)) for Z:=E7T ffl (0). (3.10) 
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Furthermore, since P is ergodic with respect to {T x ) xeIi d, abstract considerations (cf O Section 3]) 
imply that Q a is ergodic with respect to the Markov shift CO i— > CO, where X\ is sampled from P^ 
and CO from Q a . Introduce also the shorthand 

P K (-):=P(-|0e^,«) (3.11) 

and note that Q a ~ P a for all a G (0,ao]- A direct consequence of these constructions and 
Lemma |3T2l is that the time scales of the walkX and the walkX are commensurate: 

Lemma 3.3 For each a G (0, Ofo] there is j8 = p(d, a) G (0,°°) smc/j that for P a -a.e. CO, 

P%(i,K>pn) >0. (3.12) 

Proof. Fix a G (0, Oq] and let j8 be such that 

/3>£ Qa (£°(7i)). (3.13) 

Such a choice is possible because the expectation on the right is finite by Lemma [3^21 the bounds 
M°) < 2d and p (° 6 ^=o,a) > and the fact that E|^| < °o, as implied by Proposition [3JI4). 
The ergodicity of the Markov shift on the space of environments implies that, for P a -a.e. CO, 

yEQ^EKn)), P°-a.s. (3.14) 

The right-hand side is strictly less than j3 and so the claim follows. □ 

As alluded to before, the reduction to the coarse-grained walk, and the resulting time change, 
will need to be performed inside the expectation of random variables 

2t k 

Rn,k ■= £ 1<(^)1{^6B£}' ( 3 " 15 ) 

i=t k 

which — as we will demonstrate soon — will serve as a lower bound on the sum in (12.261) . We 
will need estimates on the first two moments of R n y. 



Proposition 3.4 (Moment bounds) Let d > 4 and suppose p n obeys ( 12.91 ) for some sequence 
{ne}e>\- Let a G (0, OCq]. Then there are ¥ a -a.s. finite and positive random variables C3 = Ct,(co), 
C4 = Ca,{oo) andN2 = A^w) such that for all n G {ni\i>\ with n>N2 and all k satisfying (12.101 ) 
we have 

E° a [Rn,k)>C 3 (G))p n t k (3.16) 

and 

E a {Rl k )<C 4 (co)( Pn t k ) 2 . (3.17) 

The proof of these bounds is deferred to Sections |U|7] We will now show how the ingredients 
assemble in the proof of the Key Lemma: 



Proof of Key Lemma from Proposition 13.41 It is clear that it suffices to prove the statement for 
P a -a.e. CO and all a > sufficiently small, because the support of P can be covered by the union 
of supports of P ar for some a r 1 0. We will assume throughout that n G {nt}i>\. 
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Let a G (0, Oq] and let ft) be such that there is a unique infinite connected component ^ a 
whose complement has only finite connected components. Let j3 = j8(a,<i) be as in Lemma [33] 
and suppose (13.121 ) is valid for this ft). Assume also that Ct,{(0), C^(co) and Nz((o) from Proposi- 
tion [33] are finite and positive. Consider the event 

2t k 



|£7><2j3^|. (3.18) 



Now define C2 and N\ as follows: Let C2(ft)) := lCs((o) and let Ni(co) denote the least inte- 
ger n'> N 2 ( ft)) V e 8 ^ such that 

tk > e (iogiogn') 2 ^ C 3 (fl))> Vc 4 (ft))P°(4 c ). (3.19) 



Clearly, iVi (ft)) < °° because P^(^ c ) — ► as — ► 00 holds for ft). 

Having made the necessary definitions, we can now get to the actual argument. A starting point 
is to notice that, for the paths of the random walk X belonging to S k and k such that 2fit k < n l%— 1, 
the sum in ( 12.261 ) can be bounded below by R nk , 

*4 L 1 {JQefl|}l^(X/) >Rn,0-g v (3.20) 

The upper bound in (12.101 ) shows that 2/3 < 'V2 — 1 once n > e 8 ' 3 , and so for n > N2 it suffices to 
derive the desired lower bound for E% (R n jc\g k ) instead. For this we introduce 

^n,k--={Rn,k<Mp n t k }, (3.21) 

where M > is a number to be determined momentarily, and write 

E° m (R nJc lg k ) > £° (R„jcU t n^ nj! ) 

where we also used that R, uk > 0. 

It remains to estimate the three terms on the right-hand side of ( 13.22I ). From the definition 
of ^ n ,k we immediately have 

E° m (R„, k l^J < Mp n t k R%{^). (3.23) 

For the last term in (13.22I ). since n > Ni(co) > A^(o) and k obeys (12- 10b . the Markov inequality 
and Proposition I3.4l yield 

El (R n , k l Kk ) < ^jEl «*) < (3-24) 

Along with (13.161 ) this shows that all three terms on the right-hand side of (13.221 ) are of the same 
order. This permits us to turn (13.221) into 

E m (R„M > (c,(co)-MP° m {S k c ) - < ^-)pnt k . (3.25) 

Now set M := [C 4 (ft))/P^(4 c )]' /2 and note that, by ( 13.191 ) and our choice of C 2 ((o), the term in 
the parenthesis multiplying p n t k is at least C2(ft)). □ 
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4. Moment bounds on R n>k 

At this point, the proof of our main result has been reduced to the moment estimates from Propo- 
sition 13.41 There are generally two types of technical ingredients we will need to invoke in 
both cases: appropriate heat-kernel bounds and estimates on the density of points i£Sjfl ^o,a 
where srf n (x) occurs. To demonstrate the underlying reason for invoking these facts, let us again 
begin by a heuristic argument that explains why the bound on E^(R„^) should hold true. 

Consider the coarse-grained walk X and let P m denote its transition probability on «. Ex- 
plicitly, using the notation (13.51) we have: 

P m {x,y)=P^(X Tl =y), x,y €«.,„(<»). (4.1) 

Then we can write 

E° m (Rn,k)= £ (E^CMWm" (4 " 2) 

xeB° k n^, a \t=t k / 

Since the temporal scale t k and the spatial scale of B k are related by diffusive scaling, and the 
chain X has good mixing properties, it is now quite reasonable to expect that Xg is in the time 
range t k < £ < 2t k more or less evenly distributed over B° k n ^L, a . In particular, the sum over I in 

(14.21 ) is at least of order t\ d ^ 2 , uniformly in x G B° k n ffoo.a- The bound (13.161 ) is thus reduced to 
estimating the lower density of s4 n in B° k n ff^a- 

Unfortunately, the desired lower bound on P e m (0,x) does not seem to be presently available in 
the literature and so we will have to state and prove it here: 

Lemma 4.1 Let d > 2. For each a G (0, 0!o] there is a constant c 3 > and a ¥ a -a.s. finite 
random variable N4. = A^(cd) such that 

2t k 

£Pi(0,,)>c 3 ^ /2 (4.3) 

e=t k 

holds for all x G B^f] a whenever t k > A^(cd). 

(We note that our proof of this lemma produces directly a bound on the sum, not on the in- 
dividual terms.) As already alluded to above, we will need to combine this with the following 
bound on the density of occurrences of s/„ in the set B k n : 

Lemma 4.2 Let d > 4 and suppose that ( 12.91 ) holds for some sequence {ni\e>\. Let a G (0, (Xq\. 
There is a constant C4 = c^{d, a) < °° and a F a -a.s. finite random variable N$ = Ns((o) such that 
for all n 6 {ne}e>\ with n > Ns(0)) and all k with t\ > logn, 

£ \ <{x) >c A p n \B k \. (4.4) 

Deferring the proof of these lemmas to the next sections, we observe that the bound on the first 
moment of R n j is now reduced to two lines: 

Proof of (13.161 ) from Lemma \4.2\ The rewrite (14.21 ) and the bounds (14.31) and (14.41) imply the 
desired estimate with C 3 (to) := c 3 c 4 inf yt \B k \t k d/2 and, e.g., N 2 (od) := e^ 4 ^) VN 5 ((0). □ 
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Next we turn our attention to the second moment of R n ^. It is not unreasonable to expect that 
here we will need some form of upper bounds on the heat kernel and upper bounds on the density 
of vertices where s/ n occurs. Some version of the former is already available: 

Lemma 4.3 Let d > 2. For each a G (0, Ofo], there is a ¥ a -a.s. finite random variable = 
C(,((o) such that for F a -a.e. CO, 



sup Pi(0,x)<^l, />[. (4 . 5) 

*€«.,« (a>) 11 



Proof. This is a restatement of Lemma 3.2 from ifTOll . □ 

We will need to boost this into an estimate on the Green's function associated with random 
walkX. For x,y G ^»,a this function is defined by 

G a (x,y) := £ Pi(x,y) = (1 - f^foy). (4.6) 

e>o 

In order to ease the notation, for any CO and any f,g: l/ — > M with finite supports, let 

(f,g)a>:= £ f(x)g(x) (4.7) 

denote the inner product with respect to the counting measure on ^^a- (A more natural inner 
product to consider would be that with respect to measure 7i m restricted to ^» «. However, the 
above is what naturally comes up in our calculations; conversions to other inner products will be 
the subject of Lemma [631 ) We will then need: 

Lemma 4.4 Let d > 4 and a € (0, OCq]. There are C5 < °°, r\ < °° and a F a -a.s. finite random 
variable N5 = A^(ffl) such that for all n > Ns((0) and all k with t^ > logn, the function 

fk(x) : = l, c /„( x )l{.teB°n'&,, a } ( 4 -8) 

obeys 

|jr— y|>logn 

The proof of Lemma 1441 will require some non-trivial manipulations with off -diagonal heat- 
kernel bounds and is therefore also deferred to the next sections. In order to estimate the sum on 
the right-hand side, we will also need to prove: 

Lemma 4.5 Let d > 3 and suppose that ( 12.91 ) holds for some sequence {«^}^>i- Then there is 
a constant c 6 < 00 and a F-a.s. finite random variable = Ns(co) such that, for all n G {ni}i>\ 
and all lc with t^ > logra, 

\x—y\>logn 
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As we will see in the next section, this will be easy to prove once we have a uniform bound 
on the density of srf n in large rectangular subsets of B k . We will now show how these ingredients 
combine into the upper bound on E^(R 2 k ): 

Proof of (13.171 ) from Lemmas W4\M. 51 Throughout, let us assume that n £ {ne}i>i. Note that, by 
(12.101 ). the condition tt > logra from Lemmas 144144.51 is satisfied. Recall (14.8I ). Writing R 2 nk as 
the sum of fk(Xe)fk(X^) over pairs £,£' with t k <£,£'< 2fjt, the positivity of all terms permits us 
to estimate the sum as twice the same sum with £,£' now obeying t% < I < £' < 1t\. Applying the 
Markov property and reparametrizing by means of s := £' — £ yields 

££«*)<2££ £ Pi(0 ) ^)P;(x > y)/ fc (^(y), (4.11) 

where we also extended the summation ranges of £ and s to infinity. Plugging (14.51 ) for P^(0,x), 
the sum over £ can be estimated by an integral with the result 

E° a {Rl k ) <2C 6 (co)^(t k -\) l - d / 2 (f k ,G m fk) a . (4.12) 

Lemmas 14.41 and 1431 now tell us that, for n > Ns((o) V N(,(co) and tk > logn, the inner product 

is bounded by csPn^ogn)* 1 ^ 2 + c 5 c 6 p^^ + ^ 2 . Now, by (12.101 ) we in fact have (log?i)' )+2 < 
for « 3> 1 and so, by p„log?i > 1 (as implied by (12.91 )), 

(logn)^f < p>g«)"+ 2 ;f < p, 2 ^ /2 (4.13) 

once « exceeds some finite «o- Summarizing, 

(^-l) 1 ^ /2 (A,G ffl A> ft) <2c 5 (l+c 6 )pM (4.14) 

is valid once n>no and t k obeys ( 12. 10b . The desired claim thus follows for the choices A^(to) := 
N 3 (co) VN 5 (a>) Vn andC 4 (w) := 2c 5 C 6 (o>)[l +c 6 ]. □ 



5. Density estimates 

The goal of this section is to derive the necessary estimates concerning the density of occurrences 
of event srf n in B k n and thus establish Lemmas l4~2l and 1431 Both of these lemmas will make 
use of the following claim: 

Lemma 5.1 For numbers 6 n G (0, 1), let Z n \ ,Z n ^, ...be i.i.d. Bernoulli random variables with 
parameter d n . If {n{\ k >\ I s a sequence with d„ k logn^ — > oo as k — > °°, then for any e > 0, 

/ 1 m \ 

E B E IP (e^E z «^( 1 /2,2))<oo. (5.i) 

ne{n k :k>l} m>e(\ogn) 2 ° nm 7=1 

Proof. By the exponential Chebyshev inequality, 

1 m \ 
^— E Z «.;^ 0/2,2) )<2e-^ e «, (5.2) 
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where £ := min{3 — e, V2 — e }• The sum over m is dominated by its lowest term which, by 
d n log n — > 00, tends to zero faster than any polynomial in n. □ 

We begin with the proof of the upper bound which is easier because there one can immediately 
drop the restriction that the points be contained in the infinite cluster. 

Proof of Lemma\4~5\ Suppose d > 3. Let A e (x) =x+ [-£,£] d DZ d and abbreviate A e := A^(0). 
First we claim that, for some Pa-a.s. finite random variable N' = N'(co), 

sup max max — — - £ \ <(z) < 2, (5.3) 

ne{n k ] * eA « ilog«<£<n P n \ A e\ zeAt{x) 
n>N'(w) 

where p' n := P(«e4(0)). To see this, partition U 1 into 6 d -translates of (6Z) d and label these by Zf, 
i = 1, . . . ,6 d . Clearly, it suffices to show the above for Ae(x) replaced by A l e (x) := Af(x) n Zf 
— including the normalization — for each i. Note that the events s/„(z), z G A' e (x), are i.i.d. 
with probability p' n . Now fix n G {n^}f>i, set 0„ := p^, m := |A|| and observe that m 3> (logrc) 2 
when £ > A log«. The probability that the maxima over x and £ in (15.31 ) exceed 2 is then bounded 
by the n-th term in (15.11) . The Borel-Cantelli lemma and (15.11) imply that this will occur only for 
finitely many n G {n^}^>i, P a -a.s., thus proving (I5.3I ). 

Now pick n > N'(co), use D := {2 m : m > 0} to denote the set of dyadic integers and consider 
the sum in the statement of the lemma. We have 

l-Lly-vK- 2- L ]+M d - 2 ' 

1 1 



|jc— y|>logn ^logn^M^^ M<|x-y|<2M 



For a fixed x, we extend the sum over ytoy£ A2M (x) ; since M > I log n, the sum of the indicator 
of £/„(y) is then less than 2p' n \A2M\ < 2p^(4M + l) d . The summation range of x can subsequently 
be extended to A 4 with s := y/ik, which contains Invoking (15.31) . this yields 

r.h.s.of(El<4(^) 2 (2^+l) d £ T^T - ( 5 - 5 ) 

I log n<M<^/T k 

It is now easy to check that the right-hand side is of order p%t} +d . □ 



Proof of Lemma WM Suppose d > 4. For the lower bound we will invoke some more sophisticated 
facts about percolation in d > 3. Let p := P(ft)/, > a) and let M be a dyadic integer such that the 
bond percolation with parameter p in the slab 

M M (£) :={£,£+l,...,e+M-l}xZ d - 1 (5.6) 

of width M contains an infinite cluster almost surely. The existence of such an M is guaranteed 
by Grimmett and Marstrand l25l . In particular, by the uniqueness of the infinite component in the 
slab (e.g., via Burton and Keane iPToll ) the restriction of ^ >a to Mm(£) contains a unique infinite 
connected component ^o )Ce (^) P-a.s. Note that < tC )0( (£) is independent of the edges with at least 
one endpoint outside Wm(£). 

Abbreviate §m := {x = (xi,...,Xd) G (3Z) d : x\ G 3MZ} and let fi/„(x) denote the subset 
of £/ n (x) containing the configurations such that a trap occurs at x with the trap edge (y, z) such 
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that y := x — t\ and z :=x— 2ei. (Here §1 := (1,0, ...,0).) Clearly, 

Z - L L 1 <'w 1 {^%,,„(3£ikf)}- ( 5 - 7 ) 

,reB°n c ^„, a fez jc6B°n§ M 

A key point of the construction is that, conditional on all infinite clusters {^L,a (3£M) : £ G Z}, 
the events {<<'(*) : * G Ufez ^~,« n S M } are i.i.d. with probability p' n := P(^'(0)). 

To estimate the right-hand side of ( 15.71 ), let JF M denote the a-algebra generated by the restric- 
tion of (0 to the union of slabs \J eeZ M.M(3£M) and introduce the (J?m -measurable) quantity 

Qk-=Yj Yj 1 {.ve<&, a (3i'M)}- ( 5 - 8 ) 

eez xeB° k n§ M 

Lemma [57X1 and the aforementioned independence yield 



^ ^ -*Z(*) 1 {*e«., a (3£M)} - ? 
ne{»y} it: t k >\ogn \ rnMk f eZ x€ B° k n§ M 
Qt>(logn) 2 

Therefore, in light of the restriction % > logn, there exists N' 5 = N' 5 (co) such that 

n > N' 5 , n G {rij}, \ 

„ ^ /, n2 im P 1 y L V„(*) > ~Pa&- ( 5 - 10 ) 
^>log7i,&t>(log«) 2 ,refi^k, a 2 

But the Spatial Ergodic Theorem yields Qk/\B° k \ — > y/ G (0, 1), where is (l/S)' 4-1 of the (non- 
random) density of (0) in the hyperplane {x € Z : #i =0}. Hence, there is Afj' = N'^(oo) such 
that 72 > and ^ > logrc forces Q k > \y\B k \ and (by d > 2) also > (log«) 2 . Noting that 
p„ < 2<ip^, the claim follows with N 5 := N' 5 VN% and c 4 := \y/(2d). □ 



< 



JF M < °°. (5.9) 



6. Heat-kernel input: upper bound 

Here we establish the first part of the claims involving heat kernel bounds that are needed in 
the proof of Proposition 13.41 Specifically, we will give the proof of Lemma 14.41 The strategy 
is to convert this to the same problem for the simple random walk on the supercritical percola- 
tion cluster. For this random walk we can apply existing results obtained earlier by Biskup and 
Prescott [13] and Barlow and Hambly l6l . 

Given a G (0, Ofo], let us regard ff^a as a graph with edge set inherited from {b G M d : a>b > cc}. 
Let P a ,(o denote the transition probability for the simple random walk on «(&)) which is the 
Markov chain for the conductances that are set to one for edges in ^L>, a and to zero otherwise. 
Let G a ^ a (x,y) := (1 — P a ,(o) _1 (x,j) be the Green's function for the transition kernel P a , ffl . 

Lemma 6.1 Let d > 3 and a G (0, do]. Then for 6. CO, 

Ga,a(x,y)<-. — -U-t if \x-y\>S x AS y , (6.1) 
\x — y\ a z 

where {S x (co) : x G Ko,«} are random variables satisfying 

F(S X > r\x G V^a) < e -4 '*, r > 0, (6.2) 
for some constants ci,C2,8 G (0,°°). 
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Proof. This is a restatement of the upper bound from Theorem 1 .2 of Barlow and Hambly (6J . □ 
In addition we will need to make the following observation: 

Lemma 6.2 Let d > 3 and a £ (0, Oo]. Then 

E a G ffliCe (0,0) <oo. (6.3) 



Proof. We will plug into explicit expressions derived in 111311 . Let us use 

d a>(x) ■= L !{<%>«} ( 6 - 4 ) 

y: \y-x\ = l 

to denote the degree of x in the graph ^»,a(®)- Let 

denote the continuous-time heat kernel associated with the simple random walk on ^o. a . We 
claim that the random variable 

Ki(a>) :=sup^/ 2 q f (0,0) (6.6) 
f>i 

satisfies EXi < °°. This is seen as follows: By way of our assumption c(d)P((Ob > Ct) < 1 it is 
not hard to check that the random variable Ci so (0) _1 defined in |[T3l Eq. (6.5)] has a stretched 
exponential tail and thus has all positive moments. By lfT3l Proposition 6.1], K\{(0) is bounded 
by a constant times Ci so (0)~ rf and so K\((o) has all moments as well. The observation 

/i CO /i CO 

G ai(X (p,0)=d a (p)J o q ( (0,0)d/<l+2J^i(a))y r d / 2 dt (6.7) 

now proves the claim. □ 
We will use this in conjunction with the following comparison statement: 

Lemma 6.3 Suppose d > 3 and let a € (0, Oo]. Then for P a -a,e. (0 and any (positive!) function 
f: — > [0,°°) with finite support, 

</,6«/>.<(^)V>G«,«/>.- ( 6 - 8 ) 



Proof. Informally, the comparison between the quadratic forms in ( 16.8b is a consequence of the 
fact that the Green's function, being the inverse of the generator of the Markov chain, is non- 
increasing, as an operator, in the conductances. Nonetheless, as the stationary measures for the 
two chains are different, the only way we can see how to employ this observation rigorously is 
by way of introducing an intermediate Markov chain. 

Recall our notation d m (x) for the degree of x in ^ a (o) and consider the following transition 
kernel on ^ a '■ 



P a (x,y) := < 



Kco(x) l al {(0xy > a} , if\x-y\ = l, 

1 — K m (x)~ l ocd m (x) , ifjc = y, (6.9) 

0, otherwise. 
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The distinction compared to the Markov chain described by P ra is that this chain is delayed 
at each x for a time that is geometrically distributed with parameter ad m (x)n m (x)^ 1 . As the 
Green's function counts the expected number of visits to a given point, the Green's function G ffl 
corresponding to P ffl satisfies 

G m ( x ,y) = G aa ( x , y )^L. (6.10) 
ad m (y) 

(This can be also checked directly from (1 — P m )(x,y) = ad m {x)K m {x)~ l (1 — P ra )(x,y) as implied 
by d6.9l >.) The reason for consideration of P ffl is that, unlike P ffl , this chain is stationary and 
reversible with respect to % m . As a consequence of the easy operator bound 

1-P ffl >l-P ffl on ^(^,7^), (6.11) 

we thus have G a < G m on •£ 2 ('^» ) a, 7l m ). Using (f,g)jc m to abbreviate the canonical inner product 
in ^ 2 (^o, a , n m ), for positive functions / we now get 

< W>„ < )V> W)^ (6.12) 

where we used a < 7t m < 2d, d a (x) > 1 and (16.101 ) to get the first, third and last inequalities and 
(16.111 ) to get the second inequality. □ 

We are now ready to assemble the ingredients in the upper bound on E I ^ (R^ 1 k ): 

Proof of Lemma W^4\ First we note that the random variables (S x ) from Lemma loTTl satisfv an a.s. 
estimate. Indeed, a Borel-Cantelli argument shows that for any d > 1/8 there is a P a -a.s. finite 
random variable K = K{(o) such that 

max S x ((o)<\\ogt k ] e , t k >K{co). (6.13) 

Assuming (without loss of generality) 6 > 1 and substituting (16.11 ) when \x — y\ > [logrc] 9 into 
the definition of G a (0 (x,y) we thus get 

I Am<y)GaA*,y) < I « ttt=%l - (6 - 14) 

*,y x,yeB° k nV„, a 1 + l x y\ 

\x-y\>[logn\ a \x-y\>logn 

for some absolute constant C(, = C(,{d). For the pairs (x,y) with |x — y\ < [logn] e , here we drop 
the indicators of srf n (x) and srf n (y) and invoke the standard fact 

G a ,a(x,y) < ^>a,m{x,x) ll2 Qa,miy,y) 12 - (6-15) 
Now we apply Cauchy-Schwarz (still under the restriction \x — y\ < [logn] 9 ) to get 

£ fk(x)fk(y)Ga,a(x,y) < £ Ga,m(x,x). (6.16) 

|*-y|<[logrc] e |x-y|<[logn] e 
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Summing over y yields a multiplicative term of order [logTi]^ 6 defining r\ := dd. The sum over x 
is then estimated using the Pointwise Spatial Ergodic Theorem and the bound on the Green's 
function from Lemma [6^21 by a constant times tf^ 2 , provided is sufficiently large. Combining 
(16.141) and (16.161 ). the claim follows. □ 



7. Heat-kernel input: lower bound 



Our final task in this paper is to establish the lower bound in Lemma I4TT1 Unable to directly plug 
into estimates that exist in the literature, we will have to reproduce the corresponding argument 
leading to Proposition 5.1 of Barlow [3] which is itself based on ideas adapted from Fabes and 
Stroock l22l and Nash [33]. A slight drawback of this route is that we have to work with the 
continuous time version of the chain X. 

Fix CO with G ^,. a and consider the (constant-speed) Markov process X = (X t ) t >o on tf^a 
with generator J£ a ,a> that is defined by 

(&a,wf)(x) := £ Pw{x,y)[f(y)-f( X )], x6t, a . (7.1) 

Alternatively, X, :=X^ t , where N t is the rate-one Poisson process at time t. Let q, (x, y) denote the 
associated heat kernel, 

q t (x,y) := W =y \ x,y G (7.2) 

where, abusing the notation slightly, P^ denotes the law of X with F%(Xo =x) = 1. (The normal- 
ization ensures q t {x,y) = (j, x).) We will need the following estimate: 

Proposition 7.1 Let d>2 and a G (0, OCq]. There are constants c-j > and t, > and a ¥ a -a.s. 
finite random variable Rq = Rq((») such that for ¥ a -a.e. (0 and all all R > Rq((o), 

min q, (0,jc) > c 7 e^ tR ' 2 R d , t > R 2 . (7.3) 

\x\<R 

Before we delve into the proof of this claim, let us see how it implies Lemma |4~T1 
Proof of Lemma \4J\ Consider the quantities 

a n>k := / dte '— . (7.4) 

Then, since X, has the law of X n at n := Poisson(?), 

n a (x) [f k dtq t (0,x) = £ P n m {0,x)a n! k. (7.5) 

By Proposition 17.11 with R := y^, the integral on the left is order t\ d ^ 2 uniformly in x G B^n 
C €^ M - Since a„ t k < 1, P%(0,x) < 1 and K w (x) > a, it suffices to show 

£ fl^<e-* k>l, (7.6) 

n&[tk,2t k ] 
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for some c > 0. Let Z\,Z 2 , . ■ . be i.i.d. exponential with parameter one. Then is the proba- 
bility that Y n \= Z\ -\ hZ M+ i G [|^, |^]. But the mean of Y n is n + 1 and Zi has exponential 

moments. So, by Cramer's theorem (cf, e.g., den Hollander ETl Theorem 1.4]) this probability 
is exponentially small in the distance of n to [|^, |^], which is at least tk/3. □ 

The remainder of this section will be spent on proving Proposition 17.11 In order to appreciate 
better the forthcoming definitions, it is instructive to check how the desired lower bound is de- 
rived for continuous diffusions in uniformly elliptic environments — i.e., diffusions on M. d with 
generator (Lf)(x,t) := Y ii jdi(aij(x,t)djf)(x,t), where <9/ is the partial derivative with respect 
to Xi,i = l,...,d, and where the coefficients a = (an) are uniformly elliptic in the sense that, for 
some X G (0, 1), all x G R d and all t > 0, 

X- X \t;\ 2 < £ ai j(x,t)^j<X^\ 2 , ^GM rf . (7.7) 
'7=1 

Here Fabes and Stroock (cf ll22l Section 2]) invoke an argument of Nash [ 33 ] that goes as follows: 
Let y i — y T a {t,x, y) denote the transition density for the above diffusion started at x and observed 
at time t. Setting 

H z (t):= Jdye-^ 2 lo g r a (t,z,y) (7.8) 

one then shows, via a differential inequality for t i-» H z (t), that H z (t) > c'(X) uniformly for all 
t G [0, 1], all z£R d with \z\ < 1 and all a as above (cf [22, Lemma 2.1]). Invoking the Chapman- 
Kolmogorov equations, 

r fl (2,0,*)> Jd Z e-^ 2 r a (\,0,z)r a (\,z,x). (7.9) 

But r a (l,z,x) = Fa(l,0,x — z) for a denoting the map of a under a linear transformation (shift 
and reflection) of M. d and so by taking logs and applying Jensen's inequality for the probability 
measure e ^1^1 dz (cf ll22l Lemma 2.6]) we get 

logr fl (2,0,x) >2c'(A). (7.10) 

By virtue of shifts and scaling (recall that Y a is a spatial density and the heat equation is invariant 
under the diffusive scaling of space and time), the fact that this holds uniformly in a implies the 
desired claim F a (2t,x,y) > e 2c 'Wt- d / 2 for \x-y\ < y/i. 

There are several technical obstacles that prevent a direct application of this argument to our 
present setting. The three most important ones are as follows: 

(1) Our spatial variables are discrete, so the diffusive scaling cannot be used. 

(2) Our environment is not uniformly elliptic on all scales, so we have to truncate the integral 
in (17.81 ) to "good" regions. 

(3) The derivation of the differential inequality for t \-t H z (t) in |[22l does not carry directly 
over to the discrete setting. 

Fortunately, all of these obstacles have already been addressed by Barlow in his derivation of 
a uniform lower bound on the heat-kernel for the random walk on the supercritical percolation 
cluster; cf (3] Proposition 5.1]. So we just need to adapt Barlow's reasoning while paying special 
attention only to the steps that require modifications due to a (slightly) more general setting. 
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Suppose oo is such that 6 ^L, a and recall that dist(j,(x,j) stands for the graph-theoretical 
distance on ^L>,« between * and j — i.e., the length of the shortest path from x to y over edges 
with co-conductance at least a. For R > 1, let 

K R := {x€^,«: dista,(0,x) </?}. (7.11) 

Introduce the function 

:= /RAdisa^y (712) 



and consider the weighted measure v = V R}(0 defined by 

v(x):=V R l (p(x)K m (x), (7.13) 

I |2 

where V# := £ v <p(x)7ro(x). This v will be the analogue of the probability measure e~ n > y > dy in 
the continuum setting. Fix z G K R and abbreviate 

w z , t (y):=log(V R q t (z,y)), (7.14) 

where y i-> Vsq f (z,.y) is the analogue of the transition density T a . Finally, let 

ff z (0:=£vK,(-)) (7.15) 



play the role of the quantity in (17.81) . A starting point of the derivation of a differential inequality 
for 1 1->- H z (t) is the following bound: 

Lemma 7.2 Abbreviate d>xy '■= 7l m (x)P m (x,y). Then for any z € K R , 

V R jH z {t)> 1 - £ (^(p(x)Ap(y))[w^x)-w z , f (y)] 2 

9WA 9 (y) (7 - 16) 



7 £ EMwfi-^H)- 



Proof. This is proved by literally following the calculation that begins at the bottom of page 3070 
and ends on line (5.9) on page 3071 of O. The fact that we used dist ffl instead of (perhaps more 
natural) d' ffl -distance is immaterial for the calculation. □ 

Next we will estimate the terms on the right-hand side of (17- 16b - Notice that, by (13.21) . the 
graph-theoretical distance disto(0,x) and the ^-metric \x\ are commensurate on ^..a- In partic- 
ular, K R is contained and contains ^-balls of radius of order R and, by ergodicity of the infinite 
cluster, V R thus grows proportionally to R d as R — > oo. 

Lemma 7.3 There is c$ < oo and a ¥ a -a.s. finite random variable R\ = R\ (oo) such that 

^ (J^t^ VJr2 R > Ri{a) (71?) 
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Proof. Let x,y £ K R and set k := dist ffl (x,^) A dist ffl (y, K R ) and s := dist m (x,y). Then |<p(x) 
(p(y)\ < + and so, since > 1, 



( 9 W - 9(y)) 2 < / ^2 < a/r^st^,,) 4 . (7.18) 



Therefore, 



where 



(p(x)A<p(y) 

Lh.s. of (Eni) < 9R 2 £ /j«ot v (w). (7.19) 



h a ((0):=l {Q ^ a] £ c^dist^z) 4 . (7.20) 

Bounding dist ffl (0,z) < diam ra (%) for (&o iZ > 0, using that diam ft) (%) has all moments by Propo- 
sition ET24) and noting that the sum over equals 7r ffl (0) < 2d, we have Eh a (co) < °°. Since 
(13.21 ) permits us to dominate the sum over x £ Kr by that over a cube of side proportional 7?, the 
Spatial Ergodic Theorem shows that the sum in ( 17.191 ) is bounded by a constant times Vr once 7? 
exceeds a random quantity 7?i (ft)). □ 

Lemma 7.4 There is a constant eg < °° and a F a -a.s. finite random variable R2 = Ri((o) such 
that for all z, 

£ £ d) xy( p(x)(l-^\) <c 9 V R R-\ R>R 2 ((0). (7.21) 

xeK R yeK< R V qt{Z,x)J 

Proof. Since dist ffl (x,/v^) < dist a) (x,y) whenever x £ Kr and y € iv|, we can dominate 

(p(x) < R- 2 dista,(x,y) 2 < R- 2 dist w (x,y) 4 (7.22) 

for each pair x, y contributing to the sum. Dropping the ratio of the q r terms, the result is estimated 
by the right-hand side of (17- 19b - □ 

Corollary 7.5 For R> /?i(ft))V Rj(co) andallz^KR, thefunction 

t^H z {t) + ^{c8 + c 9 )R- 2 t (7.23) 

is non-decreasing on [0,°°). 

Proof. Let h(t) := H z (t) + \(c% + c 9 )R~ 2 1 and note that, by Lemmas 17311711 V R h'(t) exceeds the 
first term on the right-hand side of (17.161) . In particular, h'(t) > 0. □ 

The bounds on the last two terms in ( 17.161 ) suggest that perhaps also the first term should be at 
least of order R . This is indeed the case thanks to: 

Lemma 7.6 (Weighted Poincare inequality) For any a G (0,ao]» there is a constant c\q = 
cio(a,d) > and a ¥ a -a.s. finite random variable R3 = R^(co) such that 

VE^^W^W) [m-f(y)} 2 > c 10 /T 2 Var v (/) (7.24) 

x,y 

holds for any R>R3 and any function f: ^^(ft)) —> M. with support in Kr. 
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Proof. We will reduce this to the corresponding statement in [3j Theorem 4.8]. Consider the 
collection of conductances defined by 

®xy := l|jc-y| = ll{ro JT >a}l{A-e c «Ca}> x,y G Z d , (7.25) 

and let fc(x) := ^jc)!^^ i<fa>(*)> where we recall the notation (16.41 ) and where Vr is the 
number that makes k a probability measure. By Theorem 4.8 and the fact that Kr is "very good" 
(in the language of 0) once R exceeds a random quantity R^(G)), we have 

V R { Y<®x>{v(x) ^V(y))[f( x ) ~ f(y)Y >^ R ~ 2 V*r K (f) (7.26) 

x,y 

for some constant £4. > 0, provided that 7? > R-i((o). (Here is where it is essential that (p is defined 
using the distance measured on the percolation graph The bound d w (x) > (2d) 7l m (x) 

for x G Ko, a yields 

Var K (/) :=£>(*)[/(*) -E K (f)] 2 

£ |i? VW [/W-^/)] 2 s|^Var v (/), (7.27, 

where we noted that the second sum is further decreased when E K (f) is replaced by E v (f). 
(Namely, a i->- E((Z— c?) 2 ) is minimized by a = EZ.) Since ft)^. < a -1 co^, (17.26jT7T27l) now yield 
the claim with cio := ce(2rf) _1 C4. □ 

The core part of the calculation is now finished by noting the following fact: 
Lemma 7.7 Let c = c(T,R,z) be defined by c(T,R,z) '■= sup r>7 - sup v q f (z,j)Vr. Then 

Var v (w,, f ) > [l ° g£ ~" M \ pU^z,X t ) < f«) -9e 2 +^)) (7.28) 
holds for all t >T and all z G «. 

Proof. This is justified by following the calculation in displays (5.8-5.9) of [3 ] just stopping short 
of substituting the explicit bound (5.2) at the very last step. □ 

Now we are ready to start constructing the proof of the lower bound on q f (0,x). Suppose d>2. 
First we notice that we do not need to prove the desired claim for all t >R 2 and \x\ < R; it suffices 
to prove it for t a constant multiple larger and \x\ a constant multiple smaller than is dictated by 
these bounds. (We will find it is easier to prove this using ^-distances; hence the formulation 
using those). 

Lemma 7.8 Let a G (0, OCq]. There exists a ¥ a -a.s. finite random variable R(, = Re((o) and 
a constant c = c(d,cc) G (0, 1) such that the following is true: If for some decreasing function 
s t— >• j3(j) G (0, 1), a constant f] G (0, V2) cind all integers R > R(,, 

min min q t (x, y) > p(tR~ 2 ) R~ d , t>R 2 /ri, (7.29) 

\x\~<R \y-x\«,<r)R 
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then for all R > AR^/r], 

min min q t (x,y) > \cri d f5(tR^ 2 )] 4/l1 R d , t>R 2 . (7.30) 

It is worth noting that this does not follow by a simple rescaling of R. Indeed, to reduce the 
lower bound on the range of t one has to also reduce the separation between x and y. 

Proof of Lemma ED Abbreviate A N (x) :=x+ [-N/2,N/2] d n Z d and set p := P(0 G tf^a)- 
By the Spatial Ergodic Theorem, there exists a random variable R' 6 = R' 6 (co) such that, for all 
N > R' 6 , every box A#(jc) with x G (NZ) d and |x|oo < 4N/7], will contain at least ^A^ d vertices 
of ^oo, a . Set 7? 6 := AR'Jr], pick /? > R 6 and note that AT := [riR/3\ > R' 6 - Now pick x,y £ ^>, a 
with \x\oc < T]R and |v — x|oo < R and let zo, ■ ■ ■ ,z m G (NZ) d be a path such that 

|zi|<»</? and — Zi\oo = N , i = l,...,m— 1, (7.31) 

and 

|jc-zo|~ < A 7 and |y < JV. (7.32) 
It is not hard to check that such a path exists forl/r] < m < A/f] — 1 . By Chapman- Kolmogorov 
and the fact that K m (-) > a on ^ «, 

m— 1 

q,«^,y)>a m £ ... £ n^'^+i) ( 7 - 33 ) 

*ieA iV (zi)n < Ko, a x m eA N (zm)n^ a ;'=o 

where xo := x and x m+ i := y. Since 

< |x/ - Zi|co + |z/-Zi+i|=o + |x,- + i -Zi+i|oo < 3A/ < r//?, (7.34) 

we are permitted to apply the lower bound ( 17.291 ) to each term in the product. Along with the 
bound |Ajv(z m ) H'tf^.al > \pN d ', this yields 

q m ,(x,y) > {\a P N d ) m {P{tR- 2 )R- d ) m+ \ t>R 2 /r]. (7.35) 

Writing t for mt, invoking the monotonicity of t h >• fi{tRT 2 ) and the bounds l/rj < »i+ 1 < 4/t] 
the claim follows with c := ^(Xp3~ d . □ 

Our task is thus to establish the premise (17.291) of the previous lemma. We begin by recalling 
the following bounds from lfl3l : There is a P a -a.s. finite random variable to = to(co) and constants 
C5,C6 < °° such that for Pa-a.e. (0 and all t > to(co), 

sup sup q,(z,x) < c 5 r d/2 (7.36) 

zeK, xetf^a 

and 

sup^dista,(z,X f ) < c 6 Vt- (7.37) 

These are implied by |[T3l Propositions 6.1,6.2] via the argument (6.33-6.37) in |[T3l and also the 
fact that the graph-theoretical distance, the Euclidean distance and also the distance associated 
with the Markov chain X on are commensurate; cf Proposition 13.11 3-5). Incidentally, the 
latter also yields 

V R <c 1 R d , R>R 4 ((0), (7.38) 
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for some constant c 7 < oo and a P a -a.s. finite random variable R 4 = R 4 (co). 

l 

2 



Introduce the constants c := csc-i{2Ac( > ) d and c' := Acio(144c) 1 and define 



7 := (2| log c\) V (2 + log36) V V J^^-- (7-39) 



Then we have: 



Lemma 7.9 There is a ¥ a -a.s. finite random variable Rj = Rj((o) such that for all R > Rj((o) 
andT := R 2 /(24c 6 ) 2 , 

sup H z (t)>-y, zeK R/2 . (7.40) 

T<t<2T 

Proof. Setting Rs((o) := sup{7? > 0: T < t (a>)}, define /?7(to) := max (= i i5 /?;(&)). First we 
note that, for t G [f (co),2r] and z G £r/ 2 , (17.371 ) implies 

~ 6 //" 1 

^(disto,(z,X,) > §/?) < (dist^z,*,) > \R) < c 6 -£- < -. (7.41) 

By (17.36b , Lemma 17771 holds for our choices of T and c, and ( 17.281) then becomes 

Var v(wv ) > [l0gC ~"^ )]2 (1 - , (7.42) 

provided that R>R 4 VR 5 ,z€ K R/2 and f G [r,2J]. 

Suppose now that the supremum in (17.401 ) is less than — y for some z G £r/2- Then (by y < 
2 + log 36) we would have e 2+H M < i/ 36 and, by way of the fact that (a-h) 2 > \h 2 holds 
whenever h < —2\a\, also [logc — H z (t)] 2 > jH z (t) 2 . The right-hand side of (17.421) would then be 
at least (U4c)- l H z (t) 2 for all t G [T,2T\. If R > R 7 , Lemmas 17721 17731 17741 1731 then give 

H' z (t) > 2c'R- 2 H z (t) 2 - ^(cs + c 9 )R- 2 > c'R- 2 H z (t) 2 , t G [T,2T\. (7.43) 

But integrating over t G [T,2T] (and using that H z (t)~ 2 stays bounded throughout by the assump- 
tion that the supremum in (17.401 ) is less than — y) yields 

H z {Ty l >c'TR- 2 + H z {2Ty l >c'TR- 2 -^>0, (7.44) 

thus contradicting the assumption that H(t) < —y < for all t G [T,2T]. □ 



Proof of Proposition \7.1\ As in the continuous setting, a key part of the proof is to show a 
(linear) lower bound on H z (t). Let R' := R\ V R 2 V R 4 V Rj, abbreviate £ := |(cs +c<f) and recall 
our notation for T above. By Lemma [7791 there is a t' G [T,2T] for which H z (t') > —y. The 
monotonicity of t h > H z (t) + C,R~ 2 t (cf Corollary 17.51 ) shows 

H z (t)>H z (t')-i;R- 2 {t-t'), t>t', (7.45) 

and so 

H z (t) > -y- &R- 2 (7.46) 
holds for all R > R' (co), all t > 2T and all z G K R/2 . 
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To see how this implies the desired claim, we invoke the Markov property, reversibility and the 
fact that (p(x) <\ to get 

V R q 2t (x,y)>£V R q t (x,z)V R q t (y,z)v(y). (7.47) 

z 

Taking logs and applying Jensen's inequality, this becomes 

log(V R q 2t (x,y)) > H x (t) +H y {t) > -2y-2^RT 2 t, x,y £ K R/2 , (7.48) 

whenever t >2T and R > R' . By the domination dist ffl (x,v) < d\x — v|oo, the ball K R / 2 contains 
all vertices x G ^L, jCe with |x|«, < R/ (2d). A simple rescaling of R then yields (17.291 ) with /3 (s) := 
c^ l (2d)- d &- 2 y-^l {2d ^\ v\ := 36cj/d 2 and/? 6 :=R' /(2d). Invoking Lemma EH (if V < 1) and 
the comparison \x\ < \x\oo, the claim follows. □ 
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